MG: Maximum Margin Microarray Gridding
نویسندگان
چکیده
Background: Complementary DNA (cDNA) microarrays are a well established technology for studying gene expression. A microarray image is obtained by laser scanning a hybridized cDNA microarray, which consists of thousands of spots representing chains of cDNA sequences, arranged in a two-dimensional array. The separation of the spots into distinct cells is widely known as microarray image gridding. Methods: In this paper we propose MG, a novel method for automatic gridding of cDNA microarray images based on the maximization of the margin between the rows and the columns of the spots. Initially the microarray image rotation is estimated and then a pre-processing algorithm is applied for a rough spot detection. In order to diminish the effect of artefacts, only a subset of the detected spots is selected by matching the distribution of the spot sizes to the normal distribution. Then, a set of grid lines is placed on the image in order to separate each pair of consecutive rows and columns of the selected spots. The optimal positioning of the lines is determined by maximizing the margin between these rows and columns by using a maximum margin linear classifier, effectively facilitating the localization of the spots. Results: The experimental evaluation was based on a reference set of microarray images containing more than two million spots in total. The results show that MG outperforms state of the art methods, demonstrating robustness in the presence of noise and artefacts. More than 98% of the spots reside completely inside their respective grid cells, whereas the mean distance between the spot center and the grid cell center is 1.2 pixels. Conclusions: The proposed method performs highly accurate gridding in the presence of noise and artefacts, while taking into account the input image rotation. Thus, it provides the potential of achieving perfect gridding for the vast majority of the spots.
منابع مشابه
M3G: Maximum Margin Microarray Gridding
margin between spots of distinct rows or columns. The width of the margin is equal to 2/�w�, therefore the widest margin is found by minimizing �w� under the constraints ci(w̄ · x̄i − b) ≥ 1, i.e. requiring that all the spots lie on the correct side of the resulting grid line. The support vector machine described above is called a hard-margin SVM and does not take into account any outliers. One o...
متن کاملM3G: Maximum Margin Microarray Gridding
margin between spots of distinct rows or columns. The width of the margin is equal to 2/�w�, therefore the widest margin is found by minimizing �w� under the constraints ci(w̄ · x̄i − b) ≥ 1, i.e. requiring that all the spots lie on the correct side of the resulting grid line. The support vector machine described above is called a hard-margin SVM and does not take into account any outliers. One o...
متن کاملUnsupervised SVM-based gridding for DNA microarray images
This paper presents a novel method for unsupervised DNA microarray gridding based on support vector machines (SVMs). Each spot is a small region on the microarray surface where chains of known DNA sequences are attached. The goal of microarray gridding is the separation of the spots into distinct cells. The positions of the spots on a DNA microarray image are first detected using image analysis...
متن کاملA Microarray Image Gridding Method Based on Projection Transformation and Power Spectral Analysis
Microarray image gridding is one important step of microarray image analysis to determine 2D image coordinates of all array spots in the hybridized gene chip image. Accuracy of image gridding will affect the reliability of gene-chip data extraction and even the final analysis results of gene-chip assays. To promote microarray image gridding accuracy and computation efficiency, we presented a mi...
متن کاملAutomatic Gridding Method for Microarray Images
A cDNA microarray is a powerful tool in biotechnology providing useful information in analyzing thousands of gene expressions simultaneously. The analysis of microarray images allows the identification of gene expressions to draw biological conclusions for applications ranging from genetic profiling to diagnosis of cancer. The DNA microarray image analysis includes three tasks: gridding, segmen...
متن کامل